Distributed Power-law Graph Computing Distributed Power-law Graph Computing: Theoretical and Empirical Analysis∗

نویسندگان

  • Cong Xie
  • Ling Yan
  • Zhihua Zhang
چکیده

Typically, a large-scale natural graph follows a skewed power law. In distributed graphstructured computations, the skewness usually makes a bad partitioning, which leads to high communication cost and workload imbalance. Therefore, graph partitioning (GP) is a challenging issue. To tackle this challenge, we introduce degree-based techniques into GP via vertex-cut. Accordingly, we develop a novel GP approach called PowerLore. PowerLore is attractive because it naturally exploits the skewed degree distribution. In addition, we conduct the theoretical analysis. And experiments on several large skewed graphs further show that our PowerLore outperforms the state-of-the-art baselines in both decreasing communication costs and guaranteeing good balance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Distributed Power-law Graph Computing: Theoretical and Empirical Analysis

With the emergence of big graphs in a variety of real applications like social networks, machine learning based on distributed graph-computing (DGC) frameworks has attracted much attention from big data machine learning community. In DGC frameworks, the graph partitioning (GP) strategy plays a key role to affect the performance, including the workload balance and communication cost. Typically, ...

متن کامل

Matrix Completion from Power-Law Distributed Samples

The low-rank matrix completion problem is a fundamental problem with many important applications. Recently, [4],[13] and [5] obtained the first non-trivial theoretical results for the problem assuming that the observed entries are sampled uniformly at random. Unfortunately, most real-world datasets do not satisfy this assumption, but instead exhibit power-law distributed samples. In this paper,...

متن کامل

Use of Structure Codes (Counts) for Computing Topological Indices of Carbon Nanotubes: Sadhana (Sd) Index of Phenylenes and its Hexagonal Squeezes

Structural codes vis-a-vis structural counts, like polynomials of a molecular graph, are important in computing graph-theoretical descriptors which are commonly known as topological indices. These indices are most important for characterizing carbon nanotubes (CNTs). In this paper we have computed Sadhana index (Sd) for phenylenes and their hexagonal squeezes using structural codes (counts). Sa...

متن کامل

A new Shuffled Genetic-based Task Scheduling Algorithm in Heterogeneous Distributed Systems

Distributed systems such as Grid- and Cloud Computing provision web services to their users in all of the world. One of the most important concerns which service providers encounter is to handle total cost of ownership (TCO). The large part of TCO is related to power consumption due to inefficient resource management. Task scheduling module as a key component can has drastic impact on both user...

متن کامل

Storage and Processing Systems for Power - Law Graphs

Large graphs abound around us – online social networks, Web graphs, the Internet, citation networks, protein interaction networks, telephone call graphs, peer-to-peer overlay networks, electric power grid networks, etc. Many reallife graphs are power-law graphs. A fundamental challenge in today’s Big Data world is storage and processing of these large-scale power-law graphs. In this thesis, we ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014